Feedback-control of quantum systems using continuous state-estimation 
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We present a formulation of feedback in quantum systems in which the best estimates of the 
dynamical variables are obtained continuously from the measurement record, and fed back to control 
the system. We apply this method to the problem of cooling and confining a single quantum degree 
of freedom, and compare it to current schemes in which the measurement signal is fed back directly 
in the manner usually considered in existing treatments of quantum feedback. Direct feedback may 
be combined with feedback by estimation, and the resulting combination, performed on a linear 
system, is closely analogous to classical LQG control theory with residual feedback. 
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I. INTRODUCTION 



The continuous measurement of quantum systems 
has been a topic of considerable activity in recent 
years PHl(| , and is particularly relevant at this time be- 
cause experimental technology is now at the point where 
individual quantum systems can be monitored continu- 
ously jfl). With these developments it should be pos- 
sible in the near future to control quantum systems in 
real time by using the results of the measurement in the 
process of continuous feedback. 

A theory describing the dynamics that results from 
feeding back the measurement signal (usually a photocur- 
rent) at each instant to control the Hamiltonian of a 
quantum system has been developed by Wiseman and 
Milburn |12|. They have shown how to derive the re- 
sulting Stochastic Master Equation (SME) for the condi- 
tioned evolution, and the corresponding unconditioned 
master equation, both of which are Markovian in the 
limit of instantaneous feedback. This kind of feedback 
has already been used to reduce laser noise below the 
shot-noise level Q . 

However, there are many ways in which the measure- 
ment signal may be fed back to affect the system. In 
general, at a given time, any integral of the measure- 
ment record up until that time may be used to alter 
the system Hamiltonian and affect the dynamics. This 
leads, however, to a non-Markovian master equation, and 
the resulting dynamics cannot therefore be easily investi- 
gated, and, more importantly, understood. Nevertheless, 
as we shall examine in this paper, certain integrals of the 
measurement record provide specific information, such as 
the best estimates of dynamical variables. These may be 
fed back to alter the system evolution in a desired way, 
and while the unconditioned evolution of the system is 
no longer Markovian, simple equations may be derived 
for the selective evolution of system variables, and cor- 
respondingly simple non-linear (but Markovian) SME's 
describe the evolution of the system in the limit of in- 
stantaneous feedback. This approach to quantum feed- 
back has close analogies to that used in classical control 
theory, in particular that control is broken down into a 



state-estimation step and a feedback step. Because of 
this, classical results regarding the design of feedback 
loops can be applied, opening up new possibilities for 
controlling quantum systems. 

In this paper we consider a single quantum degree 
of freedom, which could be, for example, a trapped 
atom ion fill or a moving mirror forming one end 
of an optical cavity fl6f| , subjected to continuous posi- 
tion observation. Naturally the continuous observation of 
position actually corresponds to a continuous joint mea- 
surement of both position and momentum, because mo- 
mentum information is implicit in the observed change in 
the position over time. We show how the best estimate of 
both the position and momentum at each point in time 
may be obtained from an integral of the measurement 
signal when the initial state of the system is Gaussian. 

We examine the dynamics which results from using the 
best estimates of the system variables in a feedback loop, 
and in particular investigate cooling and confinement us- 
ing this mechanism. We also apply the Wiseman-Milburn 
direct feedback theory to investigate the implementation 
of cooling and confinement by feeding back the measure- 
ment signal at each time, a technique which has been 
considered by Dunningham et al. |17| and Mancini et 
al. jl8| , and contrast this with the method involving es- 
timation. 

In the next section we review briefly how the SME 
describing a continuous measurement of position results 
from real physical measurement schemes. In section [II 
we review the solution of this master equation, which 
may be obtained in a simple manner for initially Gaussian 
states, and give the integrals that are required to mea- 
sure both position and momentum (ie. to obtain the best 
estimates of position and momentum at each time). We 
then examine the dynamical equations which result from 
using the best estimates for the purposes of feedback, and 
present the classical control theory which may applied to 
this quantum feedback process due to an equivalence with 
classical estimation theory. In section IV we consider the 
problem of cooling and confinement using feedback. We 
apply both feedback by estimation and direct feedback 
to this problem. Section [y] concludes. 
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II. CONTINUOUS POSITION MEASUREMENTS 

A. Two physical position measurement systems 

A continuous measurement of the position of a macro- 
scopic object may be obtained by observing continuously 
the phase of a light beam reflected from it. If we allow 
the object in question to form one end-mirror in an op- 
tical cavity, then in the limit in which one of the cavity 
mirrors is very lossy (the bad-cavity limit), the phase of 
the light output from the cavity provides a continuous 
measurement of the position of the moving mirror, since 
the light spends little time in the cavity |l9|;^0| . This is a 
simple way to treat position measurement by light reflec- 
tion, and what is more, the position of a single atom may 
also be monitored continuously in the same manner. To 
monitor the position of a single atom, the atom is allowed 
to interact off-resonantly with the optical cavity mode, 
and this interaction is such that the atom generates a 
phase shift of the output light in a manner similar to the 
moving end-mirror. We now examine briefly these two 
situations, and derive the SME describing the measured 
systems. 

The Hamiltonian describing an optical cavity in which 
one of the mirrors is free to move along the axis of the 
cavity is 

H = H m - Tig m a)ax + H d (1) 

where a is the annihilation operator for the cavity mode, 
g m = ojq/L is the coupling constant describing the inter- 
action between the cavity mode and the moving mirror 
(in which ojq is the mode frequency, and L is the length 
of the cavity) , H m is the Hamiltonian for the mechanical 
motion of the mirror, and H d describes the coherent driv- 
ing of the cavity mode. Note that we have moved into the 
interaction picture with respect to the free Hamiltonian 
of the cavity mode. In deriving this Hamiltonian it is 
assumed that the cavity mode follows the motion of the 
mirror adiabatically, and in particular that the change 
in the cavity length due to the motion of the mirror is 
small compared to the cavity length itself [^T|. That is, 
| (a;) | <C L. One of the end- mirrors is chosen to be lossy 
so as to provide output coupling, and the cavity is driven 
through this mirror. The part of the Hamiltonian which 
describes coherent driving of the cavity is given by 

H d = ihE(a- a 1 ), (2) 

w here E is related to the laser power P by E = 
y/ r yP/(fkiJo), and 7 is the decay rate of the cavity due 
to the output coupling mirror |20| . 

We see that the Hamiltonian describing the interac- 
tion between the mirror and the cavity field is of the 
form a) ax. This is exactly what we need in order to ob- 
tain a continuous position measurement by monitoring 
the phase of the output light. This is because at a is the 
generator of a phase shift for the light, and therefor a 



Hamiltonian of this form produces a phase shift propor- 
tional to the position of the mirror, which is exactly what 
is required. 

The Hamiltonian describing the off-resonant interac- 
tion between a two-level atom, and an optical cavity in 
which it is trapped, is |22| 

H = Ha-h^ a cos 2 (k x)+H d (3) 

where fco = ^o/c is the wavenumber of the cavity mode, 
A is the detuning between the cavity mode and the two 
level atom (A = uj a — u>q, where iv a is the frequency of the 
atomic transition), x is the atomic position operator, go 
is the cavity-QED coupling constant giving the strength 
of the interaction between the cavity mode and the atom 
and H a is the Hamiltonian for the mechanical motion of 
the atom. We will assume the atom to be harmonically 
trapped, which might be achieved by using a second light 
field |ll|] , or by ion tapping |23j] . 

To obtain a continuous position measurement by mon- 
itoring the phase of the output light, we require the inter- 
action of the atomic motion and the cavity mode to be of 
the same form as that for the mirror. To achieve this we 
need simply ensure that the atom is trapped in a region 
small compared to the wavelength of the light, about a 
region halfway between a node and an anti-node so that 
we may approximate cos 2 (fco^) = cos 2 (fco^o + kox') as 
1/2 + kox' . Renaming X 3iS X (merely a shift in which 
the resulting extra term in the Hamiltonian is unimpor- 
tant), we obtain the correct interaction Hamiltonian. 

To realize a position measurement the phase quadra- 
ture of the output light must be monitored, and we 
choose homodyne detection since it provides the simplest 
treatment. Performing homodyne detection of the phase 
quadrature with a detector efficiency rj, the SME describ- 
ing the evolution of the system conditioned on the con- 
tinuous measurement record is HH (see also [§|H|c|), 

dp c = -i[H> Po]dt + "fV[a] Pc dt + ^H[-ia] Pc dW, (4) 

where p c is the system density matrix conditioned on 
the measurement record, dW is the Wiener increment, 
satisfying the Ito calculus relation (dW) 2 = dt, and the 
superoperators T> and TL are given by 

2V[c]p c = 2cp c c t - c'cp c - pcc'c, (5) 
H[c\p c = cp c + p c J - Tr[cp c + PcC^Pc, (6) 

for an arbitrary operator c. 

B. Adiabatic elimination of cavity dynamics 

We are interested only in the dynamics of the atom 
(or equivalently the mirror), and we are also interested 
purely in the bad-cavity limit (large 7) which corresponds 
to good position measurement . In this limit, due directly 
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to the high cavity damping rate, the cavity mode is slaved 
to the atom dynamics, and can therefore be adiabatically 
eliminated to obtain a SME purely for the atom. To do 
this we proceed by following essentially the treatment in 
reference 

Noting first that in the absence of the interaction with 
the atom, the steady state of the cavity mode is the co- 
herent state | a = —2E/"f) , we transform the system using 



D(-a)p c D\-a) 



(7) 



where D(a) is the displacement operator, such that 
D(a)\0) = | a) ^2|. In this 'displacement picture', the 
steady-state of the cavity is now close to the vacuum, 
with the SME being 



d P ' c 



h 



[H m — hg(aJa + a(a + a') + \a\ 2 )x 1 p' c ]dt 



+jV[a]p c ,dt + y^rfH[-ia]p c >dW. 



The regime required for adiabatic elimination is 



<?(M 2 + i)K*>| 



= e < 1, 



(8) 



(9) 



where e will be our small parameter governing the ap- 
proximation. Here g = g m for the case of the mirror, 
or g = fcoSo/A for the atom. Relation (|J) also im- 
plies that (ga\(x)\/j) < e, so this quantity also serves 
as the small parameter. To proceed we assume that 
the elements of the cavity mode density matrix in the 
number basis, p'™ m , scale with the small parameter e as 
pinm ^ £ (n+m)^ anc j we w yj snow that this is consistent 

with the regime (^). Under this assumption, the state of 
the cavity+ (atom/mirror) may then be expanded up to 
second order in e as 

Pc= ^o|0)(0| + (p? |l)(0| + H.c.) 

+P?i|l){l| + (p1 |2)(0| + H.c.) + 0(e 3 ), (10) 



so that 



Pa = TT c [p' c }=p^ + pl 1+ 0(e 3 ). 



(11) 



where Tr c represents a trace over the cavity mode. 

The equations of motion for the density matrix ele- 
ments pfj may then be obtained from the master equation 

for Pc, gi vm g 

dPoo = Ooo dt + iga{xp\ Q - p^ x)dt + jp^dt 

-*v^7(Pio - Pt - Wo " pllKa)dW 
+0(e% (12) 

dp% = C° mP a w dt - 2p a w dt 

+ig[x(ap a Q + p\ + v^aplo) - ap\ lX \dt 
-i^(V2p a - p\ x - Tr[p? - ptt]p & w )dW, 
+0(e 3 ) (13) 

dPn = Oil dt + iga{xpll ~ Pw^)dt - jp^dt 



+iVWr>Ko - pfMidW + 0(e 3 ), (14) 
dPw = Oto dt - 1P a 2Q dt + igx{2p\ Q + V2ap a w )dt 

+i^Tr[p a w - pltWW + 0(e 3 ), (15) 

where we have used 

Tr[-z( a - <J)p' c ] = TrHG^o - 4)] + O(^). (16) 



and defined 



£ l m p% dt ee -±[ff m - hg(\a\ 2 + l)x, pSJ] dt. 



(17) 



In order to write a SME for the motion of the atom we 
need to find a closed form equation for p a ss Pq + p\ x but 
the differential equations for the diagonal elements of the 
cavity mode density operator involve the off-diagonal el- 
ements. The adiabatic elimination exploits the difference 
in time-scales between the cavity and the motional dy- 
namics by assuming that the heavily damped off-diagonal 
elements have reached steady state values determined by 
the motional state. This will allow the off-diagonal ele- 
ments to be written in terms of the diagonal elements 
which will result in the desired SME. This is a little 
more complicated than the usual adiabatic elimination, 
because the off-diagonal elements are not merely strongly 
damped, but also contain a stochastic driving term. 

To adiabatically eliminate p| , we drop terms propor- 
tional to p 20 which are insignificant compared to the 
damping term, and obtain to leading order 



dpi 



~lP%®dt + iV2gaxp a Q dt 



(18) 



For the purposes of showing that the contribution from 
the stochastic driving is insignificant (ie. that it is not 
of leading order in e), we assume now that p a is con- 
stant. Since p a is actually stochastically driven as well, 
this is not exactly correct; in the steady state both off- 
diagonal elements will be fluctuating. However, the re- 
sulting analysis demonstrates that these fluctuations are 
higher order in e, so that setting p a to its mean value is 
self-consistent. With this assumption Eq.(|l8|) is just lin- 
ear multiplicative white noise p4| , with the stead-state 
solution 



(P2 



^ = \n\mpii-pU(p* 



211/ 



(19) 
(20) 



where <j 20 is the standard deviation of p 20 in the steady- 
state. Since p 3 ^ is first order in e, the fluctuations about 
the steady-state are of third order in e, while the aver- 
age value is of second order. We can therefore ignore the 
fluctuations to leading order in e, and write the result of 
the adiabatic elimination as 
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P20 



iV2{ 



go 



V 7 



xp a w + 0(e 3 ). 



(21) 



Proceeding in the same way to adiabatically eliminate 
pf , we find again that the fluctuations may be ignored 
to leading order, and obtain, 



l>\, ' (-) [xpl 



p a llX ] + 0(e 2 ), 



(22) 



Note that we have retained the term proportional to p^ , 
which is only third order in e. However, retaining this 
term is useful as this allows us to most easily recover the 
SME for the mirror. This result (Eq. (22)) confirms that 
Pi is indeed first order in e. The fact that pS is second 
order then follows immediately from Eq. (21), and that 
p\ x is second order follows from the fact that pf is first 
order. Thus the assertion made above regarding the scal- 
ing of the elements of p c is seen to be consistent with the 
regime (g). 

To obtain the stochastic master equation we now sub- 
stitute Eqs. (|l]) and (§|) into Eqs. (|l|) and (|l|) and 
combine them to obtain an equation for p a pg + p^. 
To leading order in e the resulting stochastic master equa- 
tion for the atom is 



dp a = -- [H n 



hg\a\ 2 x,p a ]dt 



-2kV[x]p a dt + y%kH[x]p a dW, 



(23) 



This is the expected form for a process describing the 
continuous measurement of position Note, how- 

ever, that an extra term proportional to x appears in the 
effective Hamiltonian. This is just the radiation pressure 
force on the mirror or the dipole force on the atom. As 
this is simply a classical linear force on the mirror, it may 
be cancelled by applying an equal and opposite linear po- 
tential along with the trapping potential, and we will as- 
sume that this is the case in all further analysis. For the 
case of a measurement on an atom, k = 2k^g^\a\' 2 / '(7 A 2 ) , 
while for the moving mirror k — 2g 2 n \a\ /j. This quan- 
tity may be referred to as the measurement constant, as it 
describes the rate at which information is obtained about 
the atomic position, and the corresponding rate at which 
noise is fed into the atomic momentum as a result of 
the measurement. Note that for unit efficiency detection 
and a pure initial state, the stochastic master equation 
is equivalent to a stochastic Schrodinger equation for the 
state vector ggg|§. 

The measurement signal is the photocurrent from the 
homodyne detection, being given by 



dQ = p[-rry(i(a - <f))dt + y/rjjdW], 



(24) 



where /3 is determined by the strength of the local os- 
cillator and the reflectivity of the beam splitter in the 
homodyne detection setup g]. Using Eqs.(|l6|) and p^ ) 
in Eq. (|24|) , we may write this as 



dQ = (3[2rj^k{x)dt + y/TfydW] 

= P^fif/JZk)[A' n k{x)dt + y/2rjkdW], (25) 

and defining a scaled measurement signal by dQ = 
dQ(yj2k/ (/3 2 7)), we may write 



dQ = 4r)k(x)dt + y^JkdW. 



(26) 



The scaled photocurrent, I(t) = dQ/dt, may then be 
written as 



I(t) = 4r)k(x) + V2ryfce(£), 



(27) 



where e(t) is the delta correlated noise source correspond- 
ing to dW. 



III. ESTIMATION AND FEEDBACK 

We come now to the central part of this paper, the 
question of how to employ a continuous measurement in 
the control of a quantum system. As we noted in the 
introduction an arbitrary integral over the whole pho- 
tocurrent could be used to modulate an arbitrary feed- 
back Hamiltonian. This large number of degrees of free- 
dom makes it difficult to motivate any particular feed- 
back scheme in real systems. Perhaps as a result both 
theoretical and experimental efforts in quantum feedback 
have focussed on feeding back the photocurrent at each 
moment in time. In this paper we refer to this as direct 
feedback. However, classical control theory (2^j2(| faces 
exactly the same problems and in this paper we propose 
a family of feedback algorithms which is adapted from 
strategies employed in analogous classical systems. Clas- 
sical strategies often break the search for useful control 
down into an estimation step and a control step. The 
powerful technique of dynamic programming is able to 
find optimal algorithms for both tasks in sufficiently sim- 
ple systems |25| . In general we propose that the estima- 
tion step for a quantum mechanical system will involve 
the solution in real time of an appropriate SME which 
accounts for realistic levels of measurement and process 
noise, using an initially mixed state reflecting lack of 
knowledge of the system. The state estimate which re- 
sults from the SME can then be used to modify the sys- 
tem Hamiltonian in order to achieve the desired control 
of the system. 

In the previous section we reviewed how the stochas- 
tic master equation (Eq.(p3[)), for the continuous posi- 
tion measurement of a single quantum degree of freedom 
may be derived from a real measurement process. Fortu- 
nately, if the Hamiltonian for the mechanical dynamics 
is no more than quadratic in the position and momen- 
tum, and the initial state of the system is Gaussian, the 
SME may be solved analytically, since it remains Gaus- 
sian at all times [^7| . Taking the initial state to be Gaus- 
sian is also sensible, because there is reason to believe 
that non-classical states evolve rapidly to Gaussians due 
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to environmental interactions, of which the measurement 
process is one example p8|Jj9f . A quantum mechanical 
Gaussian state is uniquely determined by its mean values 
and covariance matrix (see for example |3(if| ), just as is 
the case for classical probability distributions and so we 
only need to find equations for these variables in order to 
fully describe the evolution of the conditioned state. In 
what follows, we will refer to the elements of the covari- 
ance matrix, being the position and momentum variance 
and their joint covariance, simply as the covariances. The 
expectation values of operators are found from 



d{x) = A{x)dt + 2y^Jk, dY(t) 



(32) 



d{c) = Tr (cdp) 



(28) 



as for any master equation, see for example P2| . The 
Ito rules for stochastic differential equations |p4H result 
in equations for the covariances. Performingthis calcu- 
lation gives the equations for the means as |14|,|3l|| 



d(x) =--{[x, H m ])dt + 2W2r]kV x dW, 
n 

d{p) = ~(\p,H m ])dt + 2,/tojkC dW, 
and the equations for the covariances are 



(29) 



V~ = 



-~([x 2 ,H m }) + ^(x)([x,H m ]) ~ 8 V kV 2 , 
~{\p 2 ,H m }) + j(p)(\p,H m ]) + 2kh 2 - 8 V kC 2 , 



C = 



2h 



([xp + P x,H m ]} -8r)kV x C 



+~(x)(\ P ,H m ]) + ~(p)([x,H m }). 



(30) 



In these equations, V x and V p are the variances in posi- 
tion and momentum respectively, and 



C = - (xp + px) - (x) (p) 



(31) 



is the symmetrized covariance. The Gaussian assump- 
tion is required to obtain Eqs. (|3fj| ) but not Eqs. (29). 
These two systems of equations are precisely equivalent 
to the SME (23) under the assumption that the initial 
state is Gaussian. 

First of all it should be noted that, while it is not ex- 
plicit, the equations for the covariances are closed, in that 
they do not depend on the means, and, in addition, they 
do not depend upon the measurement signal. As a conse- 
quence the covariances at any point in time depend only 
upon the duration of the measurement, and not the spe- 
cific measurement record. These equations are instances 
of coupled Riccati equations, and may be solved analyti- 
cally HJl^l. The full solutions are fairly cumbersome, and 
we do not need to give them here. Once these equations 
have been solved, the solutions may be substituted into 
the equations for the means, and these are readily solved 
since they are merely linear equations with (stochastic) 
driving. Writing them in the form 



where (x) = ({x),{p)) T and dY(t) = (V x ,C) T dW, the 
solution is naturally just 



(x)(t) = e At (x)(0) + 2^/2 V ke At / e~ At dY(t'). (33) 

Jo 

During the measurement process two things happen. 
The first is that the mean position and momentum obey 
not only the evolution dictated by the Hamiltonian, but 
also suffer continual random kicks due to the measure- 
ment process. This is because, at each time step, the 
effect of the measurement process is to perform a 'weak' 
measurement of position, and since the result of the mea- 
surement is necessarily random, the position of the state 
in phase space changes in a random fashion g . The sec- 
ond effect of the measurement process, and the part that 
is governed by the deterministic equations for the covari- 
ances, is to narrow the width of the state in phase space. 
For ideal (unit efficiency) detection, an initial mixture 
is reduced, over time, to a completely pure state. At 
such a time there is, in that sense, no uncertainty, as the 
quantum state is completely determined, and remains so. 
For inefficient detection, the degree to which the state is 
mixed is also reduced during the measurement, but, in 
general, to a non-zero level determined by the detection 
efficiency JL4|| . 

Now let us consider how we might control the system 
by the process of feedback. In classical control theory one 
attempts to obtain, at each point in time, the best pos- 
sible estimate of the state of the system, and then uses 
the resulting estimates for the system variables in a feed- 
back loop to control the dynamics. Now, in the quantum 
example we are considering here, since the distributions 
for all the variables are always Gaussian, the mean posi- 
tion and momentum are also our best estimates of these 
variables. In fact, it would be quite reasonable to de- 
fine a continuous measurement of a system variable as a 
process by which we obtain an estimate of that variable 
at each point in time. Hence, by this definition, what 
we need to do to achieve a continuous measurement of 
a system variable, is to write down that integral of the 
measurement signal which gives us continuously our best 
estimate of that variable. The equations for the means 
are written above in terms of the Wiener process, rather 
than the actual measurement signal dQ. Rewriting them 
in terms of the measurement signal we have 



d(x) = --{[x, H m ])dt - 8nk{x)V x dt + 2V x dQ, 
n 

d{p) = ~^([p,H m ])dt - 8r)k(x)Cdt + 2CdQ, (34) 

so that the continuous position measurement does in- 
deed provide us with a continuous measurement of both 
position and momentum, in the sense introduced above. 
The strategy we have outlined here of only employing 
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the mean values and not the variances of the conditioned 
state in the control law turns out to be optimal in some 
classical systems which are termed separable. 

The above equations require that we put in a priori 
estimates of the state as the initial conditions. While we 
expect the initial state to be highly mixed (so that in 
that sense we initially have very little knowledge regard- 
ing the state) , it is assumed that one can obtain sensible 
estimates for the initial means and covariances. This as- 
sumption is certainly reasonable, for one can almost al- 
ways obtain good estimates of the initial density matrix 
from a knowledge of the way in which the system is pre- 
pared. This is just as true in classical estimation theory. 
In that case the initial values of the variables will be, in 
general, poorly known, but good estimates for the initial 
probability distributions for the variables (being analo- 
gous to the density matrix in the quantum case) may be 
obtained from a knowledge of the initial preparation. 

A further point to note is that after a sufficient time 
the best estimates of the variables actually do not de- 
pend upon the initial estimates, but only upon the mea- 
surement record. Hence, even an observer with very im- 
precise knowledge of the initial state will have obtained 
accurate information after a time, and the resulting feed- 
back, while perhaps initially of no great advantage, will 
eventually produce the desired effect. This property of 
the SME is shown in reference pM, where the question 
of estimation is considered in detail. As a final point re- 
garding the question of estimation and initial states, it 
worth noting that one can always wait a sufficient time 
for the covariances to attain their steady-state values be- 
fore initiating feedback, thus obviating to a certain extent 
the need to use initial estimates for the covariances. In 
classical control theory this is concern about the errors in 
state estimates is termed caution, and in linear systems 
with quadratic costs and Gaussian noises caution turns 
out not to be optimal. 

Curiously enough, Eqs. ( |34| ) do not admit of an ana- 
lytic solution in terms of dQ unless the covariances have 
their steady state values. This is because the linear equa- 
tion now has an explicit time dependence due to the fact 
that the time-dependent covariances multiply the means. 
However, it is a simple matter to integrate these equa- 
tions numerically, and a computer could perform the nec- 
essary calculations to obtain the best estimates of the 
variables in real time, and hence track the evolution of 
the system. This process of estimation is not only inter- 
esting because we can monitor the system evolution, but 
because the estimates may be used in a feedback loop to 
control the dynamics. 

Now that we know how to obtain the best estimates 
of the system variables, the process of feedback involves 
continually adjusting the Hamiltonian so that one or 
more of its terms are proportional to some function of 
these estimates. In treating this process of feedback we 
must be careful to ensure that the act of feedback (the 
act of adjusting the Hamiltonian) happens after the mea- 
surement at each time step. This is obviously essential, 



due to the fact that the measurement must be obtained 
before any adjustment based on that measurement can 
take place. However, in the limit of instantaneous feed- 
back this is very simple. First we consider the measure- 
ment step, in which the system evolves for a time dt, and 
the measurement signal is incremented by the amount 
dQ. At this point the feedback is allowed to act, and in 
the limit in which it is instantaneous (that is, much faster 
than any of the time scales which characterize the system 
dynamics), the Hamiltonian is updated before the next 
time step. At the next time step the equations of mo- 
tion for the estimates (and, in fact, all system variables) 
have the new Hamiltonian with the new values for the 
estimates, so that, effectively, the Hamiltonian has the 
desired value at all times. In the limit of instantaneous 
feedback, the SME describing the evolution of the sys- 
tem is therefore simply just as it was before, but with 
the Hamiltonian, H m , replaced with a new Hamiltonian, 
having specific dependencies on the estimates of x and p, 
which are simply Tr[xp a ] and Tr[pp a ]. The SME describ- 
ing the conditional evolution of the system, for general 
instantaneous feedback via estimation from a continuous 
measurement of position, is, therefore given by Eq. (p3|) , 
where H m is now a function of the average position and 
momentum: 

H m = f(x,p, Tr[xp a ], Tr[ppJ). (35) 

While the SME for the conditional evolution is therefore 
rather simple, particularly in that it is Markovian, an 
equation that would describe the overall average (non- 
selective) evolution would not be. This is because the 
average evolution at any given time is not simply a func- 
tion of the average density operator at that time, but 
depends on the previous history. We have however pro- 
vided a recipe to calculate the unconditioned state at 
all times since this only requires averaging over all the 
trajectories generated by our conditional feedback SME. 

We will now show that there is a precise analogy to be 
made between linear quantum mechanical systems (and 
in particular those subjected to continuous position mea- 
surement) and classical systems which are driven by a 
certain specified noise process. Once we define a quan- 
tum mechanical cost function, this precise analogy will 
allow us to identify the optimal feedback strategy by us- 
ing classical LQG theory. 

In the estimation step of classical control, a system 
called a Kalman filter |2f| is often used to obtain an es- 
timate of the state of the system from the measurement 
record. Where the system is linear, and the noise on the 
system (often referred to as plant or process noise) and 
the noise on the measurement are both Gaussian, this 
is the optimal state observer. There is, in fact, a noise- 
driven classical system with noisy measurement for which 
the Kalman filter turns out to be precisely Eqs. (|2^) and 
Eqs. (|3(]). Consider a classical harmonic oscillator with 
dynamical variables x c and p c , obeying the equations |T^ ] 

x c = p c /m (36) 



G 



p c = —mu} 2 x c + y/2r)kh£i (t), (37) 



dx c = Ax c + y/2rjk(0, l) T Ci (t)dt + Bu, (40) 



where the noise driving the classical system is delta cor- 
related so that (Ci(i)Ci(i')) = ^(^ — t')' an d the classical 
measurement result, being also noisy, is given by 

Q c =4 v kx c + ^k( 2 (t), (38) 

where (C2(i)C2(i')) = S(t — t') and Ci and C2 are uncorre- 
lated. The equations for the best estimates and their co- 
variances provided by the Kalman filter are then exactly 
the same as for the quantum system, with the identifica- 
tion 

dW = 2^/2^k(x c - (x c ))dt + Cadi, (39) 

which is referred to as the innovation or the residual. 
Hence, when the quantum states are Gaussian (and in 
that sense classical), the quantum measurement process 
may be viewed as a classical estimation process in which 
noise, Ci(^)> i s continually fed into the system to main- 
tain the uncertainty relations. It is important to note 
that the strength of the noise in this analogous classical 
process is determined by the accuracy of the measure- 
ment. Unlike classical systems where higher accuracy al- 
ways results in better state estimation, high accuracy in 
the position measurement will result in large momentum 
variance, and hence large average energy, of the condi- 
tioned states. 

The existence of an analogous classical system is very 
nice because it allows us to use results from classical con- 
trol theory when considering the quantum system. In 
particular, when the cost function is quadratic in the sys- 
tem variables, a separation theorem applies to the classi- 
cal system. This states that, given that it is the values of 
optimal estimates that being fed back, when calculating 
the feedback required for optimal control we may assume 
that the dynamical variables are known exactly p5| , p6| : 
there is no advantage in considering the accuracy of our 
estimate. In this case the restriction of feeding back only 
best estimates is justified as leading to the optimal strat- 
egy. Moreover, another result which applies to the classi- 
cal system states that the optimal control law will be the 
one which would be calculated by assuming there is no 
noise either on the plant or on the measurement. This 
stronger property is termed certainty equivalence. All 
that remains to be done to allow us to match the quantum 
with the classical theory is to consider a class of quan- 
tum mechanical feedback Hamiltonians and a quantum 
mechanical cost function, so as to complete the precise 
analogy between quantum and classical systems which 
exists for the Kalman filter and the SME. 

We now examine briefly the relevant results from classi- 
cal control theory. We note that classical optimal control 
theory has been applied in the past to closed (unmoni- 
tored) quantum systems by Rabitz and co-workers [p2[ . 
The classical system for which the Kalman filter is equiv- 
alent to the SME given by Eqs.(|37]) may be written 



where x c = (x c ,p c ) T are the classical variables. Here 
we have added feedback variables u, which will be cho- 
sen to be some function of the dynamical variables x c so 
as to implement control. Note that 'optimal' control is 
defined as a feedback algorithm which minimizes a cost 
function, which is usually a function of the system state. 
The role of the cost function is to define how far the sys- 
tem state is from the desired state during the process of 
feedback-control. Classical LQG control theory tells us 
that for linear systems, driven by Gaussian noise, and in 
which the cost function is quadratic in the system vari- 
ables (LQG stands for Linear, Quadratic, Gaussian), the 
optimal feedback is obtained by choosing u = —K(x). 
The form of the cost function is chosen to be 




and the optimal solution is the one that minimizes the 
expectation value J = E[7] of this cost function over 
the time that the feedback acts. It turns out that 
K = Q~ 1 B T U and although U is time-dependent, the 
steady state value is often all that is used in practice, 
and obeys the equation p5| 

= P + A T U + UA-UBQ- 1 B T U. (42) 

We will use these results in the next section when we 
consider cooling a single quantum particle. Note that 
the choice of cost function is crucial in determining the 
optimal strategy. For example, placing boundaries on 
the available strength of feedback, rather than using a 
quadratic cost function typically implies that some form 
of bang-bang control (a control algorithm in which u 
takes one of two values) is optimal. In this case the op- 
timal strategy is therefore not linear in the estimated 
state. 

So long as the feedback Hamiltonian given by Eq. ( |35| ) 
is linear in the position and momentum operators, so that 
it has the form 

H m = /(Tr[arp a ],Tr[pp a ])x -I- g(Tr[xp a ],Tr\pp a ])p (43) 

where / and g are arbitrary functions, then the dynamic 
equations for the covariances remain decoupled from the 
equations for the means, and remain deterministic. If this 
is not the case, then the equations for the means become 
coupled to those for the covariances, and the situation 
becomes more complex. Furthermore, if the feedback 
Hamiltonian has terms that are higher than quadratic 
in x and p then the correspondence between quantum 
feedback and LQG control will be lost since it will not 
preserve the Gaussian property of the states and since the 
quantum Hamiltonian will affect the state in a way that 
is distinct from the evolution of a classical probability 
distribution. 
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It remains to define a physically motivated quantum 
mechanical cost function which maps onto the classical 
system we are considering. Clearly the part of the cost 
function which refers to the performance of the system 
should be an expectation value of the unconditioned den- 
sity operator. It should also be quadratic in the position 
and momentum, since this ensures that it is straightfor- 
ward to minimize a typical measure of control such as 
the average energy, just as it does for a classical system. 
On the other hand, when assessing the cost of control, 
it will be sufficient to consider classical quantities since 
the feedback Hamiltonian will typically be modulated by 
essentially classical quantities such as electric currents or 
lasers powers. With these considerations, we can define 
a sensible quantum mechanical cost function as 



J q = f (Tr (x T Pxp) + (u T Qu) c ) dt 
Jo 



(44) 



Where () c indicates an average over the classical random 
variables u. As noted above the density matrix average 
can be performed by first taking expectation values over 
the conditioned states given by the SME and then aver- 
aging over the trajectories. The cost function for a given 
trajectory is 



((x) T P(x) + Tr (PV) + u T Qu) dt (45) 



where the angle brackets indicate that we are talking 
about the mean values and V is the covariance matrix 
of the conditioned states. The second term in the in- 
tegral is the expectation value of P for the conditioned 
state if the mean x is zero. This term is independent of 
both the mean values and the feedback, so long as we 
use a linear feedback Hamiltonian as discussed above. It 
represents a minimum cost due to the finite width of the 
conditioned states. For a given trajectory, u is not ran- 
dom, since it is a deterministic function of the current, 
and possibly of the past values of (x) . We can now re- 
express the quantum cost by averaging Eq.(45) over the 
trajectories, which gives 



Ja 



(((x) T P(x» c + Tr(PT/) + (u T Qu) c ) dt. (46) 



On the other hand, we have identified a classical prob- 
lem for which (x) and V obey the Kalman filter equations 
for the estimate of the noisy classical state x c . Since the 
Kalman filter is merely sufficient statistics for a posterior 
probability distribution for x c , we can write the average 
over x c in J in terms of this mean and covariance, giving 



J 



(((x) T P(x)) c + Tr (PV) + (u T Qu) c ) dt. (47) 



With this we see that the classical and quantum cost 
functions are identical. Thus the quantum mechanically 
optimal strategy (given that the feedback Hamiltonians 



are no more than quadratic) for the cost function we have 
introduced will be the classically optimal strategy for the 
fictitious classical system, whose Kalman filter equations 
reproduce the SME, under the analogous classical cost 
function. Moreover, it is clear that for each SME de- 
scribing a linear quantum system subjected to a linear 
measurement, there will be some classical LQG model 
which can be constructed to find the optimal feedback 
algorithm for a similarly defined quadratic cost. 

It is important to note that this is merely the opti- 
mal strategy for a given strength of measurement. For 
example, if the aim is to minimize the energy of an os- 
cillator, overly strong position measurement will result 
in states of high average momentum and therefore en- 
ergy. It is, however, straightforward to find the opti- 
mal measurement strength, if desired. One simply uses 
the procedure above to find the optimal strategy for a 
given measurement strength, k, and takes the extra step 
of optimizing the result over k. For the physical posi- 
tion measurements we discuss in section |n|, changing the 
measurement strength corresponds to changing the laser 
power driving the cavity. 

Up to this point we have not considered how partic- 
ular feedback Hamiltonians could be implemented, and 
so we complete this section with a discussion of this im- 
portant question. Clearly terms in the feedback Hamil- 
tonian proportional to functions of x are implemented 
by applying the required force to the system. By the 
use of estimation the forces can be adjusted so that they 
are proportional to any particular function of the average 
momentum and position as indicated above. This allows 
terms to be added to the dynamical equation for the mo- 
mentum, but not to those for the position. We will show 
in the following section that in order to achieve the best 
results for phase-space localization we must add terms to 
the dynamical equation for the position, and therefore it 
is important to be able to implement a term in the feed- 
back Hamiltonian proportional to momentum. This is 
not so straightforward, but we suggest two possible ways 
in which it might be achieved. If the exact location of the 
trap is not an important consideration, then shifts in the 
position (being strictly equivalent to a linear momentum 
term in the Hamiltonian) , arc achieved simply by shifting 
all the position dependent terms in the Hamiltonian, in 
particular the trapping potential. This is a shift in the 
origin of the coordinates, and, being a virtual shift in the 
position, produces a term in the dynamical equation for 
the position proportional to the rate at which the trap 
is being shifted. When the experimental arrangement 
is such that the distance covered by the particle during 
the cooling is negligibly small compared to the trapping 
apparatus this may prove to be a very effective way of 
implementing a feedback Hamiltonian linear in momen- 
tum. A second method would be to apply a large impulse 
to the particle so that during one feedback time-step the 
particle is moved the desired distance, and an equal and 
opposite impulse is then applied to reset the momentum. 
Naturally the feasibility of this method will also depend 
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upon the practicalities of a given experimental arrange- 
ment. 



IV. COOLING AND CONFINEMENT VIA 
FEEDBACK 

A. Using feedback by estimation 

Cooling and localization of individual quantum sys- 
tems is an important first step in the process of control. 
This is certainly true for trapped atoms, ions, and cavity 
mirrors which we used as our examples in section By 
cooling, we mean localization in momentum space, and 
by confinement we mean localization in position space. 
When these two processes are combined, then we may 
speak of phase-space localization. We now apply the for- 
mulation of quantum feedback introduced in the previ- 
ous section to the problem of phase space localization. 
As indicated in that section, we can use classical control 
theory to find the optimal feedback. First however, let 
us examine the steady state solutions for the covariances 
in the absence of feedback. For a harmonically trapped 
particle the equations for the covariances become P,H 



V x = (2jm)C ~8kj]V 2 
V p = -2mw 2 C - 8kr]C 2 



2kh z 



C = V p /m - muj 2 V x - 8krjCV x 
The resulting steady-state covariances are 



(48) 
(49) 
(50) 
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c = 



2rjmuj J + 1 
/ hmuj\ £ 



where 



muj 
2hnk 



(51) 



(52) 



Clearly the final state is in general a mixed Gaussian 
state, with the exact orientation and squeezing deter- 
mined by the measurement constant, oscillation fre- 
quency, particle mass and detection efficiency. From 
these covariances the purity of the final state is readily 
obtained by using M 



Tr[p 2 c } = (h/2)(V x Vp-C 2 y 1/2 - 



(53) 



We find that the steady state purity of the monitored 
state is 



For perfect detection efficiency (77 = 1), the state is there- 
fore pure, and perfectly determined at each point in time. 
For imperfect detection the state is not completely pure, 
and is increasingly mixed as the detection becomes less 
efficient. Inefficient detection also models environmen- 
tal noise, which in the case of a cavity mirror would be 
coupling to a thermal bath, and in the case of a atom 
would be spontaneous emission |14|. The equations of 
motion for the conditioned covariances are unchanged by 
linear feedback, whether direct or using estimation, and 
Eq. (|54|) therefore gives the lower limit on the purity of 
the final cooled state achievable for a given detector effi- 
ciency. 

Now we know the covariances and resulting purity of 
the conditioned state, we want to know how well we can 
localize the mean position and momentum of this state in 
phase space, by feeding back the estimated values. The 
stochastic equations for the means are 



d(x) = ((p) /m)dt + 2y^kV x dW, 
d(p) = -muj 2 {x)dt + 2^/2^kCdW. 



(55) 



We wish to minimize the distribution of (x) and (p) about 
the origin, and so it is sensible to take the cost function 
to be minimized as in the previous section 



J g = f (Tr(x T Pxp) +g 2 (u T Qu) 
Jo 



dt 



(56) 



where q is a weighting constant which in this case has 
units of time, and 



P = Q 



muj 2 
1/m 



(57) 



Trip 2 ] = Vv- 



(54) 



With this choice of P and Q the cost function is a 
weighted sum of the energy of the oscillator and a ficti- 
tious energy one could associate with the feedback vari- 
able u. In general one can choose any quadratic function 
of the feedback variables, and a particular choice would 
be made to suit a given situation. Note that the feed- 
back variable u appears in the cost function to reflect 
the fact that we are not unrestricted in the magnitude 
of the feedback we bring to bear. If this consideration 
is relatively unimportant, q is chosen to be small, and 
so the cost function reduces essentially to the energy of 
the oscillator, which is certainly the quantity we wish to 
minimize in the process of phase-space localization. 

With this form for the cost function, classical LQG 
control theory tells us that linear feedback will provide 
optimal control. Choosing B = I in the feedback equa- 
tion, so as to allow feedback in the dynamical equations 
for both variables (the most general case), we need merely 
solve Eq. ( f42| ) for V to find the optimal value of the feed- 
back matrix K . Performing this calculation we find that 
an optimal solution is K = (l/q)I. That is, feedback 
to provide an equal damping rate on both the position 
and momentum. Note that the smaller we make the 
weighting constant q, the larger the damping rate, being 







1/q. With this feedback, the dynamical 
2sf^kV x dW, 



V — V — V 

x x — x p x 

equations for the means become 
d(x) = -T x {x)dt + {l/m)(p)dt 



d(p) = -muj 2 (x}dt - T p {p)dt + 2^/2r]kCdW. 

It is now the mean and variance of the conditioned means 
which are of interest, as they tell us how well localized 
the particle is, and about what point in phase-space. We 
will denote the means of the conditioned means as ((x)) 
and ((p)), and the covariances of the means as V£, V° 
and C e , where the 'e' refers to the fact that they are 
excess to the quantum conditional covariances resulting 
from the measurement process. Clearly the steady state 
values for the means ((x)) and ((p)) is the origin of phase 
space, while the equations for the covariances are 



V e = 



V° = 



c c = 



2Q 


r(l 


+ 4Q 2 ) 




2Q 


r{\ 


+ 4Q 2 ) 




2Q 



v, 



e 

V — — 2F V e 



. 2 u,C c + -V x 2 

r x 

2u ;C c + —C 2 
r 



e _ O/jj 

c =-(T x + r p )c — lu(v° — v°) + —Cv x , 



(58) 
(59) 
(60) 



and the tildes denote dimensionless scaled covariances 
given by 



V - —V 



p Timuj 



v P ,c = -c. 



(61) 



Putting T x = T p = T, and solving for the steady-state 
covariances, we obtain 



r(l + 4Q 2 



1 + 2Q 2 ) V x 2 + 2Q 2 C 2 + 2QCV X 
1 2QC% 



2Q 2 V 2 + (l + 2Q 2 ) C 
-QV 2 + QC 2 + CV X 



(62) 
(63) 
(64) 



where Q = w/(2T) . 

The total average covariances resulting from the lo- 
calization process are simply the sum of the conditional 
covariances and these excess covariances. The overall re- 
sulting purity may then be calculated using Eq. j5^ ) , if so 
desired. 

In the previous section we noted that terms in the feed- 
back Hamiltonian proportional to momentum are harder 
to generate than those proportional to position, and since 
the optimal feedback we have used above requires both 
kinds of terms, it is of interest to examine what may be 
achieved with a position term alone. This imposes the 
condition that K\\ — K\2 — 0, where Kij are the ele- 
ments of the feedback matrix K . To derive the optimal 
solution under this condition we solve Eq.(42) as before, 
but this time set 



B 




1 



(65) 



only a position dependent term in the feedback Hamil- 
tonian, although this is clearly not as good as using a 
combination of position and momentum damping. 

To summarize our results so far, we see that when us- 
ing feedback by estimation, and when we average over the 
conditional evolution, there is an additional uncertainty 
in the final localized state over that due to measurement 
inefficiency, and that this excess uncertainty decreases 
with the magnitude of the feedback. This additional un- 
certainty is due to the noise which is continually fed into 
the system as the result of the measurement. The effect 
of this noise is decreased as the damping constant, T, is 
increased. However, there is ultimately a limit upon the 
magnitude of the feedback, and hence upon T, and this 
is reflected in the choice of the weighting constant q in 
the cost function. We will see in the next section that di- 
rect feedback provides an alternative strategy for dealing 
with the measurement noise. 



This time taking the small q limit, qcu = 2Q <C 1, an 
optimal feedback strategy is given by K 2 i = muj/q and 
K22 = 1/9- In this case the steady state solution for the 
excess variances are 



V c = - 

V X 

r 

v: 1 

r 



1 l>. 



1 X 

V 2 

c c = --v: 



AQ 2 C 2 + AQC% 



2QC A 



(66) 
(67) 
(68) 



We see, therefore, that using feedback by estimation, it 
is indeed possible to obtain phase-space localization with 



B. Adding direct feedback 

The beauty of direct quantum feedback, formulated 
by Wiseman and Milburn |l2] , is that it may be used to 
cancel the noise which drives the mean values of the dy- 
namical variables. This is possible because the noise in 
the measurement signal is the same noise that drives the 
system. Feeding back the measurement signal itself (by 
choosing a feedback Hamiltonian directly proportional to 
this signal) essentially allows the noise driving the system 
to drive it twice at each step. If the feedback Hamiltonian 
is chosen in the right way, then the effect of the noise at 
the first step may be canceled by that at the second step. 
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The result is that the steady state of the feedback master 
equation can have the same variances as the conditioned 
states since all of the fluctuations in the mean values are 
overcome. We note that direct feedback would be anal- 
ogous to the use of residual feedback in classical control 
theory (in which the innovation is fed back to drive the 
system) to cancel the noise driving the Kalman filter. 

As in the previous section, we choose the feedback 
Hamiltonian to be linear in x and p, as this is sufficient for 
our purposes. For direct feedback, the feedback Hamil- 
tonian is proportional to the measurement signal I(t), so 
we may write 

H D = I(t)(ax + /3p). (69) 
The stochastic master equation that results is |l2| 

dp a = -~[H m , p a ]dt + 2kV[x]p a dt + -V[F]p a dt 
h rj 

—iV2k[F, xp a + p a x]dt 

+H[^/2^kx - -!-F}p a dW, (70) 

where F = (^/2kri)(ax + 0p)/h. This is precisely the 
model that has been used previously to discuss the ma- 
nipulation of the motion of atoms and mirrors through 
feedback |l7|Jlq] . Applied to a harmonic oscillator and 
initially Gaussian states it is possible to rewrite this mas- 
ter equation in terms of the mean values and covariances 
exactly as was done above. The equations for the covari- 
ances are just as before (Eqs.(|5(])), but the equations for 
the means are now 

d(x) = ((p) /m)dt + Ar)kl3(x)dt + y/2rjk(2V x + (3)dW, 
d(p) = -muj 2 {x)dt - 4rjka{x)dt + ^/2~qk(2C - a)dW. 

We see that in order to cancel the noise driving the means 
we merely need choose the feedback Hamiltonian such 
that a = 2C and (3 — ~2V X . However, it is also clear 
that direct feedback from a continuous position measure- 
ment is limited in a way in which feedback by estimation 
is not. Using direct feedback alone it is not possible to 
provide a damping term for the mean momentum, or, in 
fact, any term in the equations for the mean values which 
is proportional to the mean momentum. This is because 
we are using continuous position measurement, so that 
the measurement signal is proportional to the mean posi- 
tion, and not the mean momentum. It is this limitation 
that feedback using estimation allows us to overcome. 
Further, it is also clear that while feedback by estima- 
tion allows us to achieve phase space localization even 
in cases where it was not possible to provide the feed- 
back Hamiltonian with a term involving the momentum 
operator, direct feedback alone will not provide either 
cooling or confinement without the use of a momentum 
term. Consequently, a momentum term in the Hamilto- 
nian is crucial for the cooling achieved in this system by 



the scheme of Mancini et al. (lj] . Alternatively, in the ab- 
sence of the momentum term, these equations are clearly 
well adapted to modifying the effective potential seen by 
the atom, and this is discussed at length by Dunningham 
et al. 

It is important to note that the feedback Hamiltonian 
Eq. ( |69| ) could only be realized in the limit of an ideal 
(infinitely broad band) feedback signal, due to the fact 
that the measurement noise is white noise (at least to an 
excellent approximation). The cost, J q , of such a signal is 
therefore also infinite. The use of linear direct feedback 
cannot improve the results obtained using feedback by 
estimation (since this is already optimal), and this is re- 
flected in the fact that it eliminates the noise only in the 
limit of infinite cost, a statement which is also true of the 
optimal algorithm using feedback by estimation. How- 
ever, direct feedback does provide an alternative strategy, 
and, depending on the limitations imposed by a specific 
implementation, it might well prove advantageous to use 
it in combination with feedback by estimation. 

To summarize, the lowest temperature available is 
given by the steady-state covariance matrix of the con- 
ditional states (Eqs.(|5l|)), and in the limit in which the 
cost of control can be disregarded, either direct feedback 
or LQG control, or some combination (being analogous 
to classical LQG control plus residual feedback), give a 
means of achieving this as a limiting case. The result 
is that the final cooled, localized state has the covari- 
ances given by Eqs.(^), and the resulting purity, given 
by Eq. (|54|) , would be limited only by the detection effi- 
ciency and environmental noise. 



V. CONCLUSION 

In this paper we have shown that it is possible to for- 
mulate, in a simple manner, feedback in linear quantum 
systems such that the best estimates of system variables 
are used to control the system. This significantly ex- 
tends the range of available possibilities for the control 
of quantum systems using feedback. Due to the fact that 
in linear systems the estimation process may be mod- 
eled by its classical analogue, Kalman filtration, classical 
LQG control theory may be applied to quantum feedback 
by estimation. 

While we have focused on applying results from LQG 
theory to linear systems, there are many other techniques 
from classical state observer based control which could be 
applied to control quantum systems. For example, the 
techniques of adaptive control, where system parameters 
are estimated on-line in order to cope with non-linearities 
or uncertainties about the system to be controlled, could 
be employed. Another problem to be faced in more com- 
plicated systems is the computational overhead in prop- 
agating the state estimate. Linear systems are tractable 
classically because the mean and covariance matrix pro- 
vide all the necessary information about the posterior 
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probability distribution with the result that the whole 
distribution need not be propagated. Propagating the 
SME's of non-linear quantum systems in real time will 
require extensive computational resources. Approxima- 
tions to the full SME, perhaps along the lines of classical 
extended Kalman filters, may well be useful or necessary 
in real near-future experiments. 



ACKNOWLEDGMENTS 

ACD would like to thank Hideo Mabuchi, Sze Tan and 
Dan Walls for helpful discussions, and the University of 
Auckland for financial support. KJ would like to thank 
Prof. Dan Walls for hospitality during a stay at the Uni- 
versity of Auckland where this work was carried out. 



[1] M. D. Srinivas and E. B. Davies, Optica Acta 28, 981 

(1981) . 

[2] A. Barchielli, L. Lanz and G. M. Prosperi, 72B, 79 

(1982) . A. Barchielli, Nuovo Cimento, 74B, 113 (1983); 
A. Barchielli and G. Lupieri, J. Math. Phys. 26, 2222 
(1985). A. Barchielli, J. Phys A: Math. Gen. 20, 6341 

(1987) . 

[3] N. Gisin, Phys. Rev. Lett. 52, 1657 (1984). Y. Salama 
and N. Gisin, Phys. Lett. A 181, 269 (1993) 

[4] L. Diosi, Phys. Lett. 114A, 451 (1986); L. Diosi, Phys. 
Lett. 129A, 419 (1988); L. Diosi, Phys. Lett. 132A, 233 

(1988) . 

[5] C. M. Caves, Phys. Rev. D 33, 1643 (1986). C. M. Caves 
and G. J. Milburn, Phys. Rev. A. 36, 5543 (1987); G. J. 
Milburn, Quantum Semiclass. Opt. 8, 269 (1996). 

[6] V. P. Belavkin, Nondemolition measurements, nonlinear 
filtering and dynamic programming of non-linear stochas- 
tic processes. In A. Blaquiere (ed.), Modelling and con- 
trol of systems, Lect. Notes in Control and Information 
Sciences 121, (Springer, Berlin, 1988) pp. 245-265; V. 
P. Belavkin and P. Staszewski, Phys. Lett. A 40, 359 

(1989) ; V. P. Belavkin and P. Staszewski, Phys Rev. 
A 45, 1347 (1992); see also the review, A. Barchielli, 
Int. J. Theor. Phys. 32, 2221 (1993); D. Chruseiriski P. 
Staszewski, Physica Scripta 45, 193 (1992). 

[7] P. Zoller, M. Marte and D.F. Walls, Phys Rev A 35 198 
(1987); J. Dalibard, Y. Castin and K. Molmer, Phys. Rev. 
Lett, 68, 580 (1992). C. W. Gardiner and A. S. Parkins 
and P. Zoller, Phys. Rev. A 46, 4363 (1992). 

[8] H. J. Carmichael, S. Singh, R. Vyas and P. R. Rice, 
Phys. Rev. A, 39, 1200 (1989); H. Carmichael, An Open 
Systems Approach to Quantum Optics (Springer- Verlag, 
Berlin, 1993). 

[9] H. M. Wiseman and G. J. Milburn, Phys. Rev. A 47, 642 

(1993) . 

[10] C. M. Caves, K. S. Thorne, R. W. P. Drever, V. P. 
Sandberg and M. Zimmerman, Rev. Mod. Phys. 52, 341 
(1980). 

[11] H. Mabuchi, J. Ye and H. J. Kimble , (submitted to Appl. 

Phys. B) eprint: |quant-ph/9805076| . 
[12] H. M. Wiseman and G. J. Milburn, Phys. Rev. Lett. 

70, 548 (1993); H. M. Wiseman, Phys. Rev. A 49, 2133 

(1994) ; 49, 5159(E) (1994); 50 4428(E) (1994). 

[13] J. G. Walker and E. Jakeman, Proc. Soc. Photo-Opt. 
Instrum. Eng. 492, 274 (1985); S. Machida and Y. Ya- 
mamoto, Opt. Commun. 57, 290 (1986); M. S. Taubman, 



[14 

[is; 

[ie; 
[17; 
[is; 

[19 
[20 
[21 
[22 
[23 
[24 
[25 

[26 



[27 
[28 



[29 
[30 
[31 

[32 



H. Wiseman and D. E. McClelland and H. A Bachor, J. 
Opt. Soc. Am. B 12, 1792 (1995). 

A. C. Doherty, A. S. Parkins, S. M. Tan and D. F. Walls, 
(in preparation). 

B. E. King, C. S. Wood, C. J. Myatt, Q. A. Turchette, 
D. Leibfried, W. M. Itano, C. Monroe, D. J. Wineland, 
Phys. Rev. Lett. 81, 1525 (1998). 

S. Bose, K. Jacobs and P. L. Knight, P hys. Rev. A 56, 



4175 (1997), eprint: fcuant-ph/9708002 
J. A. Dunningham, H. M. Wiseman and D. F. Walls, 
Phys. Rev. A 55, 1398 (1997). 

S. Mancini, D. Vitali, a nd P. Tombesi, Phy s. Rev. Lett. 
80, 688 (1998), eprint: | quant-ph/9802034|; S. Manc ini, 
D. Vitali, and P. Tombesi, eprint: |quant-ph/9810022 . 
G. J. Milburn, K. Jacobs and D. F. Walls, Phys. Rev. A 
50, 5256 (1994). 

K. Jacobs, I. Tittonen, H. M. Wiseman and S. Schiller, 



(in submission), eprint: quant-ph/990204C . 

A. Pace, M. J. Collett and D. F. Walls, Phys. Rev. A 47, 
3173 (1993). 

D. F. Walls and G. J. Milburn, 'Quantum Optics' 
(Springer- Verlag, Berlin, 1994). 

G. M. Meyer, H. J. Briegel and H. Walther, Europhys. 
Lett. 37, 317 (1997). 

C. W. Gardiner, 'Handbook of Stochastic Methods' 
(Springer- Verlag, Berlin, 1985). 

O. L. R. Jacobs, 'Introduction to Control Theory' (Ox- 
ford University Press, Oxford, 1993); P. Whittle, 'Opti- 
mal Control' (John Wiley & Sons, Chichester, 1996). 

B. D. O. Anderson and J. B. Moore, 'Optimal Control: 
Linear Quadratic Methods' (Prentice-Hall, Englewood 
Cliffs, New Jersey, 1990). 

K. Jacobs and P. L. Knight, Phys. Rev. A 57 2301 (1998). 
M. Rigo, F. Mota-Furtado and P. F. O'Mahoney, J. Phys. 
A 30, 7557 (1997); B. M. Garraway and P. L. Knight, 
Phys. Rev. A 50, 2548 (1994). 

W. Zurek, S. Habib and J. P. Paz, Phys. Rev. Lett. 70, 
1187 (1993). 

C. W. Gardiner, 'Quantum Noise' (Springer- Verlag, 
Berlin, 1991). 

J. K. Breslin and G. J. Milburn, Phys. Rev. A 55, 1430 
(1997); J. Halliwell and A. Zoupas, Phys. Rev. D 52, 
7294 (1995). 

A. P. Peirce, M. A. Dahleh and H. Rabitz, Phys. Rev. A 
37, 4950 (1988); W. S. Waren, H. Rabitz and M. Dahleh, 
Science 259, 1581 (1993); A. Bulatov, B. Vugmeister, A. 
Burin and H. Rabitz, Phys. Rev. A 58, 1346 (1998). 



12 



